Genome sequence data of Lactiplantibacillus plantarum IMI 507028

Here, we report the genome sequencing data for the fermented milk isolate, Lactiplantibacillus plantarum (L. plantarum) IMI 507028. The Bioproject, SRA, and GenBank data were deposited at NCBI under accession numbers PRJNA801616, SRR18323693, and JAKMAX000000000, respectively. The size of the genome was 3,231,321 bp, with a GC% of 44.52. Before sequence trimming, the genome contained 40 contigs, in which 35 contigs were annotated, revealing 2937 coding sequences out of 3052 total genes. The strain was identified as L. plantarum with an average nucleotide identity (ANI) value of 99.9922% between IMI 507028 and L. plantarum JDM1. Genes related to antimicrobial resistance or pathogenic factors were not found during screening.


Specifications
Microbiology Specific subject area Microbial genomics Type of data Raw reads, assembled and annotated data from the L. plantarum IMI 507028 genome sequencing How the data were acquired Illumina NovaSeq 60 0 0, Unicycler v 0. 4

Value of the Data
• Lactiplantibacillus plantarum is widely isolated from fermented foods and plant sources, and is versatile for its use in products aimed at food and feed preservation. • Lactiplantibacillus plantarum IMI 507028 is a strain that can find applications in the agri-food sector; therefore, its characterization is important in terms of identity and safety. • The genome data presented herein could enrich safety-related knowledge of the lactic acid bacteria, that could be used for comparative purposes.

Data Description
Lactiplantibacillus plantarum ( L. plantarum ) IMI 507028 genome data, taxonomic identification, genome searches for antimicrobial resistance, and virulence (pathogenic) factors are described.
The analyzed genome sequencing data produced an annotated assembly of 35 contigs with a sequence length of 3,230,670 bp, GC% of 44.52, N50 contig length of 366,540 bp, and genome coverage of 529x. The total number of genes after annotation was 3052 genes, of which 2937 were coding sequences, 2 were ribosomal RNAs, and 59 were transfer RNA.
Genome comparison performed with alignment-free genome distance estimation with Mash using MinHash [1] showed the best hit (low distance and high matching) for L. plantarum JDM1 ( Table 1 ).
The alignment-based calculation of ANI is the most widely used comparative measurement for overall genome relatedness indices (OGRI) between two genomes to define boundaries between species with a proposed species boundary cut-off of 95-96% [2] .
The genomes sequences of the indicated species included in the orthoANI genome analyses are shown in Table 2 .
Data from the OrthoANI comparison [3] of IMI 507028 with closely related Lactiplantibacillus strains are shown in Table 3 . Pairwise comparisons indicated that the IMI 507028 genome shared 99.9922% identity with the strain L. plantarum JDM1. The cut-off value for species identification was set at 95% [4] .
To assess the antibiotic resistance genes present in the sequenced genome IMI 507028, three different databases were used, and no hits with ≥80% identity and 70% coverage were identified, with criteria and thresholds proposed by the European Food Safety Authority (EFSA) for whole Table 1 Taxonomic identification of IMI 507028 used as query-ID with Mash, a fast genome distance estimation using MinHash. The search was made for the closest matching NCBI RefSeq genomes. Mash distance approximates the mutation rate. The sketches are set to include 400 hashes (derived from k-mers of size 16) selected randomly from each genome for pairwise comparison. For example, IMI 507028 and the top hit L. plantarum JDM1 share 398 out of the 400 hashes. genome sequencing data analysis [4] . In addition, no hits were identified for toxin and virulence factor genes potentially carried by IMI 507028 when screened against an updated virulence factor database (VFDB). The search for potential plasmids was performed using PlasmidFinder, and contig 23 showed a partial match with L. plantarum subsp. plantarum P-8 plasmid LBPp6.

Experimental Design, Materials and Methods
Detailed genome sequencing data analysis was performed as previously described [5] except for the annotation performed using NCBI Prokaryotic Genome Annotation Pipeline (PGAP) v6.0 [6] . Sequencing produced 6,141,736 raw reads and 1,854,804,272 base pairs. After trimming 5,850,376 paired reads, 1,708,275,654 base pairs were obtained.

Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.